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ABSTRACT 


The  ability  of  operators  to  detect  step  changes  In  the  order  of  control 
dynaailcs  Is  Investigated  as  a Joint  fimctlon  of  a)  participatory  node:  whether 


subjects  are  actively  controlling  those  dynasdcs  or  are  monitoring  an  autopilot 


controlling  them,  and  b)  concurrent  task  workload.  A theoretical  analysis  of 


detection  In  the  two  modes  Identifies  factors  that  will  favor  detection  In 


either  mode.  Five  subjects  either  tracked  or  monitored  the  system  dynamics 


on  a 2-dlmenslonal  pursuit  display  under  single  task  conditions  and  concurrently 


with  a “subcrltlcal"  tracking  task  at  two  difficulty  levels.  Latency  and  accuracy 


of  detection  were  assessed  and  related  through  a speed-accuracy  tradeoff  repre- 


sentation. Detection  performance  was  faster,  and  only  slightly  less  accurate 


In  the  manual  as  opposed  to  the  autopilot  mode  and  performance  In  each  mode  was 


derogated  by  the  concurrent  tracking  requirement,  but  not  by  Increases  In  loading 


task  difficulty.  Further  analysis.  Involving  multiple  regression  techniques 


ensemble  averaging  and  examination  of  response  latency  distributions  suggested 


that  manual  superiority  was  attributable  to  the  additional  proprioceptive  Infor- 


itlon  resulting  from  control  adaption  to  the  system  change.  The  effects  of  the 


loading  task  on  detection  and  upon  primary  task  tracking  were  Interpreted  In 


tatms  of  the  concept  of  limited  processing  resources 
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INTROri CTION 

Over  the  past  decade,  the  aviation  industry  has  witnessed  a gradual  change 
in  the  role  of  the  pilot  in  the  cockpit.  Many  traditional  pilot  functions  have 
been  replaced  by  on-board  computers,  and  in  some  Instances  the  pilot  is  no  more 
than  a supervisor  [1]  or  monitor  of  automatically  controlled  functions.  One 
task,  however,  that  remains  of  critical  Importance  to  the  operator  of  any  avia- 
tion system,  whether  he  is  removed  from  the  control  loop  or  not,  is  that  of 
monitoring  all  facets  of  aircraft  performance  for  the  occurrence  of  failures  or 
malfunctions.  The  relatively  low  frequency  of  occurrence  of  such  events  does 
not  diminish  the  importance  of  failure  monitoring  and  detection,  because  the 
consequences  of  an  undetected  malfunction,  or  one  that  is  detected  after  an 
unnecessary  delay,  can  be  disastrous,  potentially  resulting  in  the  loss  of  the 
aircraft  or  of  human  life.  It  can  be  argued  in  fact  that  one  criterion  that 
should  be  used  in  considering  whether  a pilot  should  remain  in  the  control  loop 
under  particular  conditions  is  his  relative  sensitivity  to  system  malfunctions 
in  the  two  modes  of  participation. 

Young  [2]  has  argued  strongly  on  the  basis  of  his  findings  that  the  operator 
is  more  sensitive  to  system  malfunctions  as  an  active  participant  in  the  control 
loop,  than  as  a passive  monitor.  In  his  experiment,  subjects  were  required  to 
detect  various  step  changes  in  system  order  and  gain.  Conditions  were  compared 
in  which  the  subject  was  an  active  controller  and  a passive  monitor  (who  was 
observing  the  compensatory  display  produced  by  another  active  controller). 

Under  these  circumscances  detection  latencies  were  two  to  five  times  greater  for 
the  monitor  than  the  controller.  A «utiood  study  which  also  compared  detection 
ability  in  the  two  modes, however,  resulted  in  contradictory  findings. 

Ephrath  [3]  investigated  failure  detection  performance  in  a two-dimensional 
simulated  landing  task  as  a Joint  function  of  participatory  mode  and  workload.  The  j 

l-  'C  . 

"failures,"  which  in  this  case  were  deviations  Introduced  into  the  flight  path 
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rather  than  changes  In  system  dynamics,  could  occur  in  either  the  pitch  or  yaw 
channel.  Under  different  conditions  sulijects  were  either  in  control  when  a 
failure  occurred  cr  were  monitoring  a nonadaptive  autopilot  in  control  of  that 
channel.  The  non-failed  channel  could  also  be  either  controlled  or  monitored. 
Ephrath's  results  indicated  a clear  superiority  for  detection  on  the  monitored 
as  opposed  to  the  controlled  dimension,  both  in  terms  of  the  smaller  number  of 
missed  failures  and  of  the  shorter  detection  latency.  This  difference  Ephrath 
attributed  in  large  part  to  the  increased  level  of  workload  involved  in  the 
controlled  task. 

Obviously,  in  many  respects  the  studies  of  Young  and  Ephrath  are  not 
comparable.  Young  employed  single-axis  tracking  with  changes  in  system  dynamics, 
while  Ephrath  employed  dual-axis  simulator  control  with  "deviation"  failures 
together  with  a secondary  task  designed  to  measure  workload.  In  addition,  the 
monitoring  conditions  were  different  in  the  two  experiments,  being  influenced 
by  adaptation  in  Young's  study  and  not  In  Ephrath's.  In  this  light,  it  is  not 
surprising  that  the  conclusions  differed  dramatically.  Certainly,  one  of  the 
most  salient  differences  between  these  studies  lies  in  the  contrast  between 
single-  and  dual-axis  tracking  and  is  inherent  in  the  greater  workload  imposed 
in  the  latter  condition. 

While  numerous  other  investigations  of  failure  detection  performance  aru 
present  in  Che  literature  [2,4],  the  studies  of  Young  and  Ephrath  are  the  only 
two  that  have  explicitly  contrasted  detection  between  Che  two  modes,  so  that  a 
direct  comparison  is  possible.  The  present  study  was  conducted  with  Che  intent 
of  clarifying  the  nature  of  the  superiority  relation  between  the  two  modes.  A 
question  of  specific  interest  was  whether  the  difference  in  results  between  Che 
results  of  Ephrath's  and  Young's  study  could  be  attributable  Co  differences  in 
concurrent  Cask  workload  between  the  paradigaw,  and  for  this  reason  secondary 
task  workload  was  manipulated  orthogonally  to  participatory  mode. 
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Theoretical  Analysts  of  Failure  Detect i -m 

The  detection  of  a failure  or  change  in  the  characteristics  of  a dynamic 
system  requires  that  the  detector  have  available  two  basic  elements:  (1)  an 
internal  representation  of  the  state  of  the  normally  operating  system^-the 
expected  value  of  state  variables  and  their  expected  variability  [5,6]  and 
(2)  a channel,  or  set  of  channels,  of  information  concerning  the  current  state 
of  the  system.  Failures  are  detected  when  the  information  concerning  the 
current  system  state  is  assessed  to  be  sufficiently  deviant  from  the  representa- 
tion of  normal  operation  to  warrent  a decision.  The  decision  process  involved 
may  be  assumed  to  involve  the  application  of  some  statistical  decision  rule  [7], 
More  'specifically  it  is  assumed  that  the  detection  process  involves  the 
integration  over  time  of  noisy  evidence  concerning  any  differences  between 
current  and  normal  operation  until  the  decision  criterion  is  exceeded  within 
some  time  period.  Provided  that  a failure  has  ocevred,  the  quality  of  this 
noisy  information  will  Increase  with  the  number  of  information  sources  and  with 
integration  time,  as  long  as  memory  for  the  standard  of  normal  operation  remains 
salient.  Provided  memory  is  salient  then,  failure  decisions  should  become  more 
prevalent  at  longer  latencies.  However  it  is  also  reasonable  to  assert  that, 
if  sufficient  time  has  lapsed  with  the  post-failure  dynamics,  the  internal  model 
of  normal  operation  itself  begins  to  reflect  partially  the  new  dynamics,  and 
thus  the  "strength"  of  the  difference  signal  that  is  Integrated  becomes  attenuated. 
Thus  while  integrated  Information  grows  over  time,  the  diagnostic  value  of  that 
information  will  eventually  decline,  dictating  that  detection  accuracy  (or  number 
of  detections)  will  not  be  monotonically  increasing  with  latency  but  rather 
will  reflect  this  tradeoff,  achieving  a maximum  at  an  intermediate  latency. 

Figure  1 presents  a schematic  comparison  between  the  failure  detection 
process  described  idien  the  operator  is  in  the  control  loop  (top)  and  when  he 
is  removed  from  the  loop,  monitoring  autopilot  control  (bottom).  In  both  modes. 


4 


the  available  channels  of  information  to  the  decision  making  system  concerning 


current  system  state  are  represented.  The  following  theoretical  analysis, 
employing  the  conceptual  framework  described  above  and  represented  symbolically 
in  Figure  1,  will  attempt  to  define  the  characteristics  or  attributes  of  each 


participatory  mode  that  might  be  expected  to  enhance  the  sensitivity  of  failure 
detection  in  that  mode. 


Insert  Figure  1 about  here 


Consistency  of  internal  model  of  dynamics.  An  evolving  conception  in 
control  theory  is  that  the  operator  maintains  an  internal  representation  or 
"model  in  the  head"  of  the  dynamic  system  that  is  being  controlled.  It  is 
assumed  here  that  this  model  provides  the  basis  for  predicting  expected  system 
outputs  in  response  to  known  Inputs  - an  internalized  estimate  of  the  transfer 
function  of  the  system  being  controlled.  This  conception  is  consistent  with 
that  employed  by  Curry  & Gai  [7]  and  Miller  and  Elklnd  (8]  and  others.  With 
respect  to  failure  detection,  a critical  characteristic  of  an  internal  model 
relates  to  its  internal  consistency  or  expected  variability.  For  any  given 
input  to  the  system,  the  range  or  variability  of  expected  outputs  is  a measure 
of  this  consistency. 

It  is  proposed  that  when  the  operator  is  actively  controlling,  the  stability 
of  this  internal  model  is  considerably  greater  than  when  he  is  monitoring.  This 
difference  reflects  the  fact  that,  when  controlling,  the  operator  has  a greater 
involvement  with  the  system,  and  a direct  knowledge  of  its  input-output  character- 
istics available  by  comparing  his  control  inputs  with  the  system  response  {9]. 

This  Information  is  only  available  when  monitoring  if  the  monitored  display  is 
pursuit,  and  even  then,  knowledge  of  control  inputs  is  not  as  precise,  since  a 
system  response  due  to  regulatory  error  correction,  cannot  easily  be  discriminated 
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fron  on*  resulting  from  external  disturbances.  Thus  in  the  control  mode  a 


gaeller  variance  estimate  of  the  normal  state  exists  in  the  decision  center  and 


therefore  detection  should  show  greater  sensitivity  to  departures  from  this 


state  induced  by  changing  dynamics,  than  should  detection  in  the  monitoring  mode 
Information  channels.  A second  attribute  of  the  control  suxle  that  predicts 


superiority  of  failure  detection  is  the  greater  numbr^r  of  channels  of  informs- 
tion  concerning  the  current  state.  When  monitoring,  information  is  provided 
to  the  operator  exclusively  via  the  visual  channel  (system  error  and  its  deri- 
vatives in  the  compensatory  display  or  input  and  outputs  plus  derivatives  in  the 
pursuit  display).  On  the  other  hand,  in  the  control  situation  the  operator 
also  has  available  a proprioceptive  channel  of  information  concerning  his  own 
input  to  the  control  stick,  independent  of  disturbances  acting  upon  the  system. 

Although  control  input  cannot  directly  reflect  the  occurrence  of  failures 
(except  as  failures  initiate  mechanical  feedback  from  the  control  itself),  it 
will  do  so  indirectly  to  the  extent  that  any  compensatory  adaptation  that  the 
operator  initiates  to  a system  change  will  be  reflected  in  a change  in  his 
response  characteristics  (mean  control  position,  velocity  or  acceleration)  and/or 
the  characteristics  of  the  operator's  open-loop  transfer  function.  When  con- 
trolling then,  these  proprioceptive  channels  will  be  available  to  the  detection 
system  to  supplement  the  visual  channels  that  are  available  in  both  monitoring 


and  controlling  (Figure  1) 


While  the  controlling  mode  thus  seemingly  provides  a distinct  advantage 


over  the  sMnitoring  mode  by  virtue  of  its  added  proprioceptive  channel,  it  should 
be  noted  that  this  advantage  is  not  invariably  present  for  reasons  telated  to 
the  non- independence  of  control  input  and  error.  More  specifically.  If  adapta- 
tion to  the  failure  is  rapid  and  complete,  as  may.  occur  for  example  in  response 
to  shift*  in  system  gain  [2],  the  obteined  distribution  of  error  following  the 
chenge  would  show  little  or  no  alteration  from  that  characterising  the  normal 
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operating  state,  while  a change  would  be  manifest  In  the  characteristics  of  the 
control  response  and,  therefore,  the  transfer  function. 

Failure  to  initiate  any  adaptive  control,  on  the  other  hand,  would  leave 
unchanged  the  proprioceptive  Input,  while  altering  both  Che  nature  of  the  error 
distribution  and  again,  the  resulting  transfer  function.  In  short,  whether  or 
not  an  adaptive  response  Is  Implemented,  the  transfer  function  will  change.  If 
adaptation  occurs,  the  response  will  change  as  well.  If  It  does  not,  then  the 
error  distribution  will  be  altered. 

However,  even  provided  with  only  two  sources  of  Information  (transfer  func- 
tion plus  error  or  control  response)  rather  than  three,  a comparison  of  number 
of  channels  still  favors  the  control  mode  over  monitoring.  Assuming  that  there 
Is  some  degree  of  Independence  of  Information  processing  along  the  channels, 
the  probability  and/or  speed  of  detecting  change  Information  along  any  one  of 
two  channels  characterising  the  control  mode,  should  be  greater  than  that  of 
detecting  change  along  the  single  visual  channel  available  in  the  monitoring  mode. 

Differential  senaltlvlty  to  visual  vs  proprioceptive  Information. 

Although  a strict  comparison  of  the  number  of  channels  of  Information  available 
to  a decision  swchanlsm  favors  control  over  monitoring,  an  Important  caution 
should  be  noted.  As  described  above,  the  operator  la  able  to  trade  off  the 
strength  of  the  failure  occurrence  "signal"  along  the  visual  vs  proprioceptive 
channel,  to  the  extent  that  he  engages  in  some  degree  of  compensatory  adaptation. 


As  adaptation  Increases,  proprioceptive  "signal  strength”  inci 


at  the 


expanse  of  visual  error  "signal  strength."  Thus  the  prediction  based  upon  the 
difference  in  number  of  information  channels  — that  control  detection  will  be 
superior  to  sranltoring  detection  — is  predicated  upon  the  assumption  that 
detection  of  change  is  equally  efficient  along  all  the  channels  (proprioceptive, 
transfer  function,  and  visual).  Xn  other  words  this  approach  assiaas  thee, 
whichever  channals  are  employed  in  the  control  mode,  their  Joint  signal  will  be 
■ore  easily  detected  than  the  single  visual  signal  in  the  monitoring  mode. 


Mitlgatiag  against  this  conclusion,  however,  is  a body  of  literature  in 
psychology  suggesting  that  the  sensitivity  to  proprioceptive  information  is 
reduced  relative  to  visual  inforraation  particularly  when  the  two  sources  are 
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available  at  the  same  time  and  are  conveying  conflicting  Information  te.g.,  10, 

11,  12].  Such  a conflict,  in  fact,  describes  precisely  the  situation  in  which 
an  operator  has  successfully  adapted  to  a change  in  control  dynamics.  Under 
these  circumstances,  the  visual  error  channel  is  providing  information  describing 
normal  operation  (since  the  appropriate  gain,  or  lead-lag  adjustment,  has  pre- 
sumably been  initiated  to  restore  the  original  open-loop  transfer  characteristics), 
wttile  the  less  sensitive  kinesthetic  channel  conveys  the  information  that  a 
change  has  in  fact  been  implemented.  The  predicted  consequence  of  this  conflict 
situation  is  that  the  operator  will  be  leas  likely  to  detect  the  change 
he  would  had  no  adaptation  baen  achieved,  the  latter  condition  of  course  pro- 
ducing a visual  signal  equivalent  to  the  monitoring  mode.  McDonnell  [13],  in 
fact,  has  noted  anecdotally  such  Instances  in  which  successful  adaptation  has 
been  coupled  with  the  failure  to  detect  dynamic  system  changes. 

Workload  differences.  A second  characteristic  of  the  manual  control  mode 
that  predicts  a reduced  sensitivity  to  the  occurrence  of  failures  relates  to  the 
greater  workload  imposed  by  tracking  than  by  monitoring.  Numerous  examples  may 
be  cited  from  behavioral  literature  that  demonstrate  the  attention  demands  of 
purely  perceptual  tasks  sudi  as  monitoring  to  be  less  than  those  of  tasks  such 
as  tracking  in  which  a requirement  for  the  selection  and  execution  of  responaes 
is  also  imposed  [14,  IS].  This  finding  is  verified  as  well  in  a direct  compari- 
son of  controlling  vs  autopilot  monitoring  in  the  simulator  [16].  In  the  frame- 
work of  the  present  analysis,  if  monitoring  for  and  responding  to  failures  is 
regarded  as  a "task”  separata  from  tacking,  then  since  the  operator's  attentional 
resources  are  limited,  the  greater  workload  demands  imposed  in  the  control  mode 
than  in  the  monitoring  mode  would  predict  poorer  perfonaanca  on  the  added  "task" 
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of  failure  detection  in  the  former  condition. 

Whereas,  workload  differences  make  a clear  prediction  of  detection  differences 
in  Che  single-task  environment,  this  prediction  is  not  as  apparent  when  the  per- 
formance of  additional  casks  is  required.  A coonon  result  emanating  from  suich 
dual-task  research  is  that  casks  that  are  in  themselves  simpler  or  less  loading 
are  at  the  same  time  more  vulnerable  to  performance  decrements  in  a dual-task 
environment,  as  more  demanding  paired  casks  ''capture"  a greater  proportion  of 

I 

available  attcntional  resources  [17].  In  the  current  context,  monitoring  - the 
simpler  task  - should  be  more  vulnerable  to  additional  dual-cask  requlreswnCs 
chan  controlling. 

Furthermore,  Co  the  extent  that  failure  detection  while  cracking  is 
dependent  upon  the  processing  of  information  integral  to  the  tracking  task, 

Chen  the  quality  of  this  Information  — and,  therefore,  the  quality  of  detection 
itself  — will  be  preserved  as  tracking  perfonaance  is  guarded  in  the  face  of 
competing  secondary  task  demands.  In  contrast,  Che  quality  of  visual  infonsation 
available  in  monitoring  will  be  predicted  by  this  view  to  deteriorate;  rendering 
detection  while  monitoring  more  vulnerable  to  loading  than  while  controlling. 
Summary 

The  implications  of  the  preceding  theoretical  analysis  ara  complex.  In 
summarieing,  two  attributes  of  the  controlling  mode  may  be  identified  that 
would  seemingly  facilitate  failure  detectionl  a greater  eonsiatency  of  the 
internal  model  of  the  system,  and  a greater  number  of  channels  available  upon 
which  to  base  failure  detection  daeisiona.  At  the  same  time,  the  Utter  adxantage 
may  be  mitigated  to  the  extent  that:  (a)  odaptatiom  takaa  place  reducing  the 
strength  of  a visual  error  signal  and,  (b)  proprioceptive  sensitivity  is  less 
than  visual.  In  comparison  the  monitoring  mode  is  also  characterised  by  two 
attributes  that  could  facilitate  detections:  a greater  "strength"  of  the  visual 


signal  (if  adaptation  by  an  autopilot  does  not  take  place)  and  a lower  level 
of  workload . 
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Finally,  it  is  argued  that  any  advantage  of  monitoring  over  controlling 
attributable  to  workload  differences  might  itself  be  dissipated  as  the  compe- 
tition for  attentional  resources  is  increased  by  imposing  concurrent  tasks. 
Clearly  this  interplay  of  factors  is  sufficiently  complex  to  prohibit  precise 
predictions  concerning  the  superiority  of  one  mode  over  the  other.  It  does, 
however,  facilitate  a clearer  identification  of  the  nature  of  the  failure 
detection  task  and  allows  predictions  to  be  formulated  concerning  the 
differential  effect  of  variables  such  as  workload  or  control  adaptation  on 
detection  performance. 

In  the  following  experiawnt,  independent  variables  of  participatory  mode 
and  task  workload  were  manipulated  to  detensinc  their  effect  on  detection. 
Analysis  techniques  were  then  employed  in  an  effort  to  identify  further  the 
nature  of  the  processes  operating  In  detection  perfonsance. 


METHOD 

Subjects 

The  subjects  were  five  right-handed  male  university  students  enrolled  in 
basic  flight  training  courses  at  the  Institute  of  Aviation.  Subjects  were  paid 
at  a rate  of  $2.50  per  hour. 

Apparatus 

The  basic  experimental  equipment  included  a 3 x 4 inch  Hewlett  Packard 
Model  1300  CET  display,  a spring-ceatared,  dual-axis  tracking  hand  control 
(with  an  index-finger  trigger)  operated  with  the  other  hand,  and  a Raytheon 
704  16-bit  digital  computer  with  24k  memory  and  A/D,  D/A  interfacing  that  was 
used  both  to  generate  inputs  to  the  tracking  display  and  to  process  responses 
of  the  subjects.  The  subject  was  seated  on  a chair  with  two  arm  rests,  one  for 
the  tracking  hand  controller  and  one  for  the  side-task  finger  controller.  The 
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subject'*  eyes  were  approximstely  112  centimeters  from  the  CRT  display  so  that 

the  display  substended  a visual  angle  of 

Tracking  tasks.  The  primary  pursuit-tracking  task  required  the  subject  to 

match  the  position  of  a cursor  with  that  of  a target  which  followed  a semi- 

predictable  two-dimensional  path  across  the  display.  The  target's  path  was 

determined  by  the  suimaation  of  two  non-ha tmonically  relate  sinusoids  along  each 

axis.  The  frequencies  were:  X-axis,  .08  and  .05;  Y-axis,  .08  and  .05.  The 

position  of  the  following  curaor  was  controlled  Jointly  by  the  subject's  control 

response  and  by  a band- limited  forcing  function  with  a cutoff  frequency  of  .32 

Hs  for  both  axes.  Thus  the  two  inputs  to  the  system  were  well  differentiated  in 

terms  of  predictability,  bandwidth,  and  locus  of  effect  (target  vs  cursor).  The 

control  dynamics  of  the  tracking  task  were  of  the  form  Y ■ ^ - + — ? — for 

c ® ^ 
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each  axis,  where  a was  the  variable  parameter  used  to  introduce  changes  in  the 
system  dynamics.  These  changes,  or  simulated  failures,  were  introduced  by  step 
changes  in  the  acceleration  constant  a from  a normal  value  of  .3,  a mixed  velocity 
and  acceleration  system  with  a high  weighting  on  the  velocity  component,  to 
a " .9,  a system  that  approximates  pure  second  order  dynamics. 

As  the  loading  task,  the  Critical  Task  [18],  was  employed.  This  was  displayed 
horisontally  at  the  bottom  of  the  screen  and  required  the  subject  to  apply  force 
to  the  spring-loaded  finger  control  in  a left-right  direction  to  keep  the  un- 
stable error  cursor  centered  on  the  display.  The  value  of  the  instability 

k X 

constant  X in  the  dynamics  Y was  set  at  a constant  subcritical  value. 

c s-x 

TWO  values  (X  " .05  and  X **  1.0)  were  employed  on  different  dual  task  trials. 
Experimental  Task 

Subjects  participated  in  five  experimental  sessions  of  which  the  first 
two  were  devoted  entirely  to  practice  on  the  tracking  and  detection  tasks,  and 
the  last  three  used  to  generate  the  experisiental  data.  During  the  first  practice 
day  tha  subject  performed  only  the  tvo-diawnsional  pursuit  tracking  task.  Xn 
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the  manual  (MA)  condition  the  subject  p-rformed  the  tracking  manually  while 
in  the  autopilot  (AD)  condition,  his  role  in  the  control  loop  was  replaced  by 
a simulated  autopilot  control  dynamics  consisting  of  a pure  gain  and  effective 
time-delay.  The  open  loop  gain  was  set  at  a constant  value  for  all  subjects, 
and  Che  time  delay  value  was  adjusted  for  each  subject  ho  obtain  an  error  measure 
in  Che  AU  condition  equivalent  to  the  operator's  performance  in  the  MA  condition. 
This  value  of  time  delay  was  maintained  throughout  the  rest  of  the  experiment. 
Each  trial,  MA  or  AU,  lasted  130  seconds. 

To  give  the  subjects  some  experience  with  the  failed  condition  (i.e.,  Che 
higher  acceleration  in  the  control  dynamics),  the  subject  received  two  trials 
(one  AU  and  one  MA)  in  which  he  tracked  (or  viewed  the  autopilot  Cracking)  only 
the  failed  dynamics.  Two  demonstration  trials  were  then  presented  in  which  the 
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subject  tracked  in  the  regular  condition,  but  the  onset  of  each  failure  was  cued 
by  Che  presentation  of  a "F"  on  Che  screen.  The  subject  was  instructed  to  press 
Che  trigger  to  return  the  system  to  normal  only  upon  the  detection  of  the  nature 
of  the  change.  This  training  period  was  then  followed  by  8 regular  detection 
trials  (4  AU,  4 MA  in  alternating  order).  Each  trial  contained  either  4 or  6 
failures  so  that  a total  of  20  failures  were  presented  in  each  suxic. 

The  presentation  of  the  failure  was  generated  by  an  algorithm  chat  assured 
random  intervals  between  presentations  and  allowed  the  subject  sufficient  time 
to  establish  baseline  tracking  perfornwncc  before  the  onset  of  the  next  change. 
Task  logic  also  insured  that  changes  would  only  be  intrdduced  when  system  error 
was  below  a criterion  value.  In  the  absence  of  this  latter  precaution,  changes 
would  soaMtimes  introduce  obvious  "jumps"  in  cursor  position. 

During  these  detection  trials,  the  detection  decision  was  recorded  by 
pressing  the  trigger  on  the  control  stick.  This  response  presented  a "T"  on 
the  screen  and  returned  the  systeai  to  normal  operating  conditions  via  a four- 
second  ramp  to  the  prefailure  dynsmlcs.  If  the  subject  failed  to  detect  the 
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change,  the  ayatein  returned  to  noinal  rfter  aix  aeconda.  Thla  vaa  an  interval 


within  which  it  waa  asauaied,  on  the  baeia  of  preteat  data,  that  reaponaea  would 
corrcapond  to  detected  failurea  and  not  to  falae  alarma.  The  aubjecta  were  told 
to  detect  aa  many  changea  aa  poaaible  aa  quickly  aa  poaaible. 

On  the  aecond  day  (dual«taak  training)  the  aubject  performed  the  primary 
tracking  taak  together  with  a aide  task,  the  Critical  Task.  After  a refresher 


trial  in  the  MA  mode,  the  awMect  received  a series  of  training  trials  to  practice 
the  side  task,  first  in  the  AU  and  then  la  the  MA  mode.  When  acceptable  criteria 
were  achieved  in  the  Critical  Task  and  MA  tracking  individually,  the  aubject  then 
carried  out  these  tasks  together  with  the  failure  demonstrations,  as  described 


above 


Eight  more  experimental  trials  were  then  presented  in  which  the  subject 
performed  all  three  tasks  (tracking  or  monitoring.  Critical  Task,  and  failure 
detection).  Two  trials  were  presented  in  each  mode  at  each  level  of  Critical 
Task  difficulty  (1  - 0.5,  1.0).  The  subject  was  instructed  to  "do  the  side- 
stick  taak  aa  efficiently  and  accurately  aa  possible."  The  instructions,  there- 
fore, clearly  defined  the  side  task  as  the  loading  task  while  allowing  perform- 
ance on  the  tracking  and  detection  tasks  to  fluctuate  in  response  to  covert 
changes  in  available  attentional  resources.  These  instructions  were 
emphasised  by  providing  subjects  with  trial  by  trial  feedback  on  critical 
task  performance.  In  this  manner,  workload  demands  were  experimentally  mani- 
oulated.  rather  than  being  passively  assessed. 

Following  the  two  training  days,  the  final  three  days,  used  to  generate  the 
data  for  experimental  analysis  followed  the  format  of  Table  1.  The  order  of 
presentation  of  the  12  experimental  trials  waa  counter-balanced  across  subjects 
and  across  days  within  a subject.  The  task  logic,  instructions,  and  experimental 
procedure  was  otharwise  identical  to  that  on  days  1 and  2. 
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Insert  Table  J about  here 


ANALYSIS 

Assuaptlons  from  signal  detection  theory  [19]  were  employed  to  account  for 
detection  performance  In  terms  not  only  of  the  proportion  of  failures  detected 
(hit  rate)  • but  also  the  nuiid>er  of  detection  responses  made  In  the  absence  of 
failures  (false  alarms).  The  signal  detection-based  sensitivity  Index  reflects 
changes  In  both  of  these  values.  Some  modification  of  classical  signal  detection 
analysis  procedures  was  required  because  of  the  undefined  nature  of  the  response 
interval  [20].  According  to  this  procedure  It  Is  necessary  Initially  to  specify 
the  Interval  following  each  failure  signal  to  be  designated  as  a "hit"  Interval. 

The  data  from  a number  of  pretests.  In  which  dynamics  did  not  return  to  the 
pre-failure  level,  Indicated  that  the  distribution  of  subject  responses,  following 
signal  occurrence,  showed  a peak  at  around  three  seconds  and  reached  a relatively 
stable  baseline  by  six  seconds  following  a failure.  Therefore,  six-second  Inter- 
vals were  defined  as  hit  Intervals,  and  the  measure  F(HIT)  was  simply  the  number 
of  detection  responses  falling  within  the  Interval  divided  by  the  total  number 
of  intervals.  The  remaining  duration  of  the  trial  was  similarly  subdivided  Into 
six-second  false  alarm  Intervals.  The  measure  P(FA)  was  computed  as  the  number 
of  false  alarms  divided  by  the  number  of  false-alarm  Intervals. 

Because  of  the  relatively  small  number  of  signals  presented,  and  the  ques- 
tionable applicability  of  the  formal  signal  detection  theory  assumptions  to  the 
current  data,  the  nonparametric  measure  of  the  area  under  the  ROC  curve,  P(A), 
was  employed  as  the  bias-free  swasure  of  sensitivity  [19].  Values  to  this 
measure  were  computed  from  the  P(HIT)  and  P(FA)  data  by  reference  to  tables  In 
McMicol  [21].  This  measure  produces  a score  varying  from  0 to  1.0  for  which 
0.5  represents  chance  performance  and  1.0  represents  perfect  accuracy.  Both 
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the  F(A)  measure  and  Che  mean  and  sCand.'ird  deviation  of  detection  latencies 
were  computed  at  Che  end  of  each  trial. 

TrackioR  performance.  Tracking  measures  of  vector  error,  vector  control 
position,  and  Critical  Task  error  were  sampled  every  60  msec  and  stored  on 
digital  tape  for  later  data  analysis.  Error  and  control  position  were  also 
differentiated  to  obtain  their  respective  velocity  values.  In  addition,  on  a 
fourth  channel,  the  occurrences  of  failures  and  responses  were  recorded.  At 
the  end  of  each  trial,  Che  RMS  vector  error  on  the  primary  task  and  RMS  error 
on  the  Critical  Task  (if  performed)  were  computed. 

Results 

Detection  Performance 

Table  2 presents  the  average  hit  and  false  alarm  probabilities  for  each 
subject  in  each  of  the  6 conditions.  Following  the  procedure  outlined  above, 
these  were  converted  to  F(A)  measures,  and  this  bias-free  accuracy  variable, 
plotted  as  a joint  function  of  detection  latency  and  condition,  is  shown  in 
Figures  2 and  J.  Because  considerable  experimental  literature  Indicates  that 
adding  a secondary  task,  and  Increasing  its  difficulty  may  have  qualitatively 
different  effects  on  primary  Cask  performance,  [22,  23}.  Figure  2 contrasts 
single  with  dual-task  detection  (the  mean  values  of  the  two  critical  task 
conditions) , while  Figure  3 shows  the  effect  on  detection  of  the  Critical  Task 
difficulty  level  (a.g. , 1 ■ .5  vs.  X ■ 1). 


Insert  Table  2 and  Figures  2 and  3 about  here 


The  rationale  for  Che  joint  speed-accuracy  representation  of  Figures  2 and 
3 Is  that  these  two  variables  represent  different  manifestations  of  an  underlying 
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Table  2 

Detection  Data 


L 

MA 

AU 

r 

Sub iect 

Single  task 

X»  .5 

X-  1.0 

Single  task 

X ■ .5 

X - 1.0 

L 

C.W.  P(H) 

.406 

.267 

.267 

.733 

.700 

.467 

[ 

P(FA) 

.071 

.060 

.080 

.130 

.100 

.200 

[ 

E.L.  P(H) 

.767 

.533 

.567 

.833 

.533 

.533 

[ 

P(FA) 

.050 

.150 

.120 

.120 

.190 

.150 

r 

R.S.  P(H) 

.467 

.433 

.567 

.533 

.458 

.423 

li 

i’ 

P(FA) 

.270 

.200 

./l20 

.280 

.166 

.255 

1. 

T.O.  P(H) 

.267 

.375 

.200 

.533 

.700 

.429 

L 

P(FA) 

.110 

.133 

.170 

.120 

.110 

.186 

i: 

M A.  P(H) 

.500 

.400 

.167 

.833 

.800 

.750 

(■ 

P(FA) 

.075 

.075 

.073 

.058 

.090 

.090 

1 ■ 


3. 


Latency  (sec) 


Figure  2;  Effect  of  pertlclpetory  aode  and  secondary  taak  performance  on 
detection  accuracy  and  latency.  Mean  subject  trend  (heavy 
lines).  Individual  subjects  (thin  lines). 


Accuracy  P(A) 


Critical  Task 

Easy  Difficult 

X=.5 


Figure  3:  Effects  of  parclclpetory  aode  end  eecondery  task  difficulty 
on  detection  accuracy  and  latency. 


performance  metric.  In  any  effort  to  compare  "performance”  across  conditions. 


the  Joint  implications  of  speed  and  accuracy  must  be  taken  into  account  [24,  25] 


For  example,  a condition  that  produces  a high  accuracy  of  responding  might  do  so 
at  such  a prolonged  latency  that  the  utility  of  that  decision  in  a real-world 
context  is  less  than  that  of  a more  rapid  decision  with  slightly  lower  expected 


accuracy.  Furthermore  the  underlying  model  as  proposed  in  the  introduction 


that  information  is  integrated  over  time  until  a decision  criterion  is  reached 


suggests  that  speed  and  accuracy  may  be  "traded  off"  by  manipulating  the 


decision  criterion.  This  tradeoff  presents  another  Justification  for  this 


Joint  representation 


In  Figures  2 and  3,  "good"  performance  (fast  and  accurate)  is  represented 


in  the  upper  left  hand  regions,  while  poor  performance  ia  in  the  lower  right 


In  an  orthogonal  direction,  shifts  in  bias  for  speed  vs.  accuracy  correspond 


to  moveswnt  between  the  lower  left  (speed  bias)  and  upper  right  (accuracy  bias) 


in  the  space.  These  shifts  may  be  inferred  to  relate  to  variations  in  setting 


of  the  decision  staking  criterion.  The  lighter  vectors  portray  the  data  of  the 


five  subjects,  while  the  heavier  ones  represent  the  mean  trend,  with  each  end 


of  the  vector  corresponding  to  the  bivariate  mean  of  the  five  individual  subject 


data  points.  Because  of  the  importance  of  viewing  individual  subject  data. 


the  brackets  below  represent  the  average  aiagnitude,  across  subjects  of  > 1 


standard  error  confidence  estiisates  along  both  the  latency  and  accuracy  axes 


By  this  representation,  it  is  possible  to  view  sisuiltaneously  the  trend  of  suh 
Ject*s  behavior  in  the  speed -accuracy  spaca,  the  extent  to  which  the  trend 


typifies  the  behavior  of  all  subjects  and,  through  the  confidence  brackets  the 
reliability  of  the  trends  shown  by  Individual  subjects. 

Undoubtedly  the  swst  noteworthy  effect  in  Figure  2 is  the  increase  in 


response  latency  from  the  N4  to  the  AU  condition  (t  • 5.84,  £ < .001).  Vhila 


this  increase  in  latency  for  the  AU  awde  is  consistent  for  all  subjects,  it  is 
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accompanied  by  an  increaae  in  reaponse  .iccurately  that  is  less  pronounced  and 
is  only  evident  in  both  single  and  dual  task  conditions  for  three  of  the  five 
subjects  (C.W.,  T.O.,  and  H.A.)  Thus  the  prevailing  trend  induced  by  shifting 
from  the  MA  to  AU  mode  appears  to  be  a shift  in  bias  to  the  upper  right  in  the 
speed  accuracy  apace:  towards  slightly  more  accurate,  but  considerably  slower 
detection. 

The  chance  in  performance  induced  by  the  requireoient  to  perform  the  addi- 
tional Critical  Task  (Figure  2)  is  in  the  direction  of  poorer  performance: 
increased  latency  and  decreased  accuracy.  This  is  a trend  predictable  from  the 
assumption  of  limited  operator  processing  resources.  The  mean  vectors  indicate 
that  this  trend  is  of  about  the  tame  magnitude,  and  in  the  same  direction  for 
both  the  AU  and  MA  modes,  although  it  is  considerably  less  consistent  across 
subjects  in  the  MA  condition,  with  two  subjects  (R.S.  and  H.A.)  showing  vectors 
that  do  not  run  in  the  predicted  direction.  R.S.  shows  an  accuracy  increase, 
and  H.A.  a latency  decrease  with  Critical  Task  performance. 

In  Figure  3,  presenting  the  effect  of  increasing  Critical  Task  difficulty 
on  detection  performance,  the  mean  trend  in  both  modes  shows  a decrease  in 
accuracy.  However,  this  la  only  consistent  across  subjects  in  the  AU  mode 
(with  minor  exception  of  E.  L.  who  shows  a minimal  accuracy  increase).  Further- 
more, in  both  aK>dcs  the  increase  in  Critical  Task  difficulty  appears  to  lead 
to  a slight  decreuae  in  average  response  latency.  Subjects  M.A.  (MA  mode  only) 
and  C.W.  are  exceptions  here. 

While  the  trends  in  neither  data  set  of  Figure  3 are  striking,  the  finding 
of  interest  is  perhaps  the  very  fact  that  trends  are  not  pronounced  when  they 
might  otherwise  have  been  expected  with  increased  loading  task  difficulty.  The 
reason  for  this  expectation  is  evident  from  Figure  4 which  presents  the  effects 
of  the  workload  aainipulation  on  tracking  performance.  It  can  be  seen  here  in  the 
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v.,> 


MA  mode  that  tracking  performance  deteriorates  equally  as  the  critical  task  is 


added  (t  ••  3.05,  £ < .01)  and  as  its  difficulty  Increases  (t  ■ 1.71,  jg  < .05) 
(the  manipulations  corresponding  to  those  portrayed  in  Figures  2 and  3 respec- 
tively). Even  though  there  is  a slight  increase  in  Critical  Task  error  with  X 


it  is  safe  to  conclude  that  increasing  Critical  Task  difficulty  leaves  fewer 
processing  resources  available  to  devote  to  performance  of  the  tracking  task. 
Yet  in  the  MA  mode,  this  diminuition  of  resources  does  not  appear  to  reduce 
detection  performance  in  any  marked  fashion  for  other  than  subject  MJk. 
Similarly  in  the  AU  mode,  the  trend  of  detection  performance  with  increasing  X 
can  not  be  described  as  an  unambiguous  decrease  in  performance,  but  rather 
as  a shift  in  bias  to  fast  inaccurate  responding  (Figure  3). 


Insert  Figure  4 about  here 


Information  Utilised  in  Detection 


In  the  introduction,  two  hypotheses  were  proposed  to  predict  why  detection 

might  be  superior  in  the  MA  over  the  AU  srade,  and  two  were  proposed  predicting 

AU  superiority.  The  results  presented  in  Figures  2 and  3 and  suggest  that 

« 

manual  detection  was  generally  superior,  and  the  data  were  analysed  in  detail 
to  determine  their  consistency  with  the  second  hypothesis  proposed  for  manual 
superiority  i.e 


, the  role  of  the  added  proprioceptive  channel 
two  analysis  techniques  were  pursued  to  identify  thi  cues  employed  and  provide 
insight  into  the  nature  of  the  detection  process.  (1)  Ensemble  averages  of 
display  end  control  variables  were  const meted  primarily  to  determine  the 
existence  of  failure  "signals'* t time  varying  characteristics  of  the  variables, 
time-lockad  to  failures  and  increasing  during  the  post-failure  interval 
(Figure  S) •Separate  averages  were  eonstmetad  for  hit  and  miss  trials  in  each 
condition.  (2)  Multiple  regression  techniques  were  employed  to  datermina  what 
characteristics  of  the  signal  and  rasponsa  were  the  best  predictors  of  detection 


In  this  endeavor 


ftfuc*  4 1 If  faces  of  Ccltlecl  Tack  dUfloulcy  on  parforaiaea  of  priauunr 
traeklag  and  of  CriCteal  Task.  Subjact  naans  (top). 
IndlTldual  sub j acts  (botton)* 


latency.  As  predictor  variables  error,  error  velocity,  control  velocity  and 
cursor  velocity  were  sampled  at  the  Instant  of  failure,  at  .6,  1.2,  1.8,  2.4, 
3.6,  and  4.8  seconds  after  the  failure.  The  results  of  this  analysis  are 
presented  in  Table  3.  The  results  of  these  two  techniques  will  be  discussed  as 
they  bear  upon  the  question  of  the  cues  utilized  for  detection. 


Insert  Figure  5 and  Table  3 about  here 


Detected  Failures 
Missed  Failures 


Absolute 


Absolute 


Absolute 

Control 


Latency  otter  Failure  a , it,, 

(eeconds) 
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Tnble  3 


26 


I 

li 

■I! 


Multiple  Regression  on  Response  Latency* 
Condition 
MA 


AU  * 


Variable  Multiple  Partial  Variable  Multiple  Partial 
Name r r Marne  r r 


Order  of 

Prediction 

Variable 


1 

Error 

Error 

1.2  sec. 

.310 

-.310 

.6  sec 

.357 

-.357 

2 

Control 

.670 

-.170 

Error 

.490 

-.309 

vel.  0.6 

Velocity 

.6  sec 

3 

Error 

Error 

Velocity 

.521 

-.251 

Velocity 

.524 

-.213 

1.2  sec 

2.4  sec 

Predictor  variables  were  excluded  from  the  table  if  they  occurred  at  latencies 
equal  to  or  greater  than  the  mean  detection  latencies 
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Is  perfornlng  some  sort  of  control  adaptation  to  the  new  post-failure  plant 
dynamlcs--an  adaptation  directed  to  bring  error  to  Its  prefailure  level. 

According  Co  predictions  of  the  crossover  model  [26],  Che  increase  in  system 
order  produced  by  Che  failure  requires  Che  operator  to  develop  greater  lead, 
differentiate  Che  error  value  and  produce  a response  velocity  of  higher  average 
value.  Thus  Co  Che  extent  that  adaptation  is  carried  out,  control  velocity 
should  increase.  Furthecuiore,  even  if  adaptation  is  not  Che  linear  response 
predicted  by  the  crossover  model,  but  represents  instead  a time-optimal  bang- 
bang  response  [27],  the  later  strategy  should  still  produce  an  increased 
control  velocity. 

In  the  data  presented  in  Figure  5 a distinct  Increase  in  control  velocity 
is  visible  following  failure,  supporting  the  view  that  adaptation  was  carried  out. 
This  observation  supports  Che  hypothesis  that  MA  superiority  is  based  upon  the 
added  proprioceptive  channel  of  information,  since  the  figure  indicates  clearly 
that  this  information  (manifest  in  the  increasing  control  velocity)  is  available 
prior  to  detection,  as  a channel  to  the  decision  maker  of  Figure  1.  A further 
corroberation  of  the  use  of  this  chantml  is  found  in  Che  multiple  regression 
analysis  Chat  identifies  control  velocity  as  the  second  predictor  of  response 
latency.  The  level  of  correlation,  although  modest,  is  found  to  be  consistently 
negative  with  latency,  at  Che  .6  and  1.2  second  post-failure  time  points. 

Latency  distribution.  The  ensead»le  average  and  multiple  regression  analysis 
of  Che  single  task  data  suggest  Chat  more  rapid  MA  deteeCion  may  be  attributed 
to  Che  adaptation-related  proprioceptive  information  channel  that  becomes  available 
to  the  decision  center  within  1-2  seconds  following  the  failure  occurrence.  This 
interpretation  receives  more  direct  support  from  an  analysis  of  the  distribution 
of  response  letencies.  In  the  currant  data,  these  distributions  for  all  MA 
conditions  were  highly  skewed  in  a positive  direction,  while  those  of  the  AU 
conditions  were  approKimetely  sysseetrical.  The  latency  distributions  were 
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transformed  to  cumulative  probability  distributions  portraying  the  relative 
number  or  probability  of  failures  detected,  as  a function  of  latency  after 
failure  (Figure  6).  Lappin  [28]  has  argued  that  a similar  representation  of 
his  reaction  time  data  >•  the  latency  operating  characteristics (sometimes 
referred  to  as  the  cumulative  accuracy  function)  ••  may  provide  evidence  bearing 
upon  the  time-dependent  processes  involved  in  detection:  the  integration  of 
evidence  over  time.  Following  Lappin 's  approach,  it  is  argued  here  that  the 
data  of  Figure  6 may  be  interpreted  as  follows:  Since  detection  latencies  are 
assumed  to  reflect,  in  part  the  instant  at  which  the  sampled  evidence  exeedes 
the  decision  criterion,  the  extent  to  which  accuracy  increases  as  the  criterion 
moves  to  longer  latencies  (through  between  and  within  subject  variation)  is  a 
reflection  of  the  rate  of  accumulation  by  the  decision  center  of  failure-related 
information  in  the  post-failure  interval.  Thus  the  slope  of  a function  of 
relative  accuracy  vs.  latency  represents  the  rate  at  which  perceptional  evidence 
becomes  available,  while  the  level  of  the  function  or  intercept  represents  the 
overall  quality  of  that  infdrmation. 


Insert  Figure  6 'about  here 


According  to  this  interpretation,  three  important  characteristics  are 
evident  concerning  the  single  and  dual  (X  ■ 1.0)  task  data  of  Figure  6t  (1) 

Both  of  the  HA  functions  indicate  the  presence  of  a diatinct  discontinuity  in  the 
rate  of  aceuisulation  of  evidence,  this  discontinuity  occurring  at  approximately 
1-1.5  seconds  post-failure.  In  a manner  consistent  with  the  earlier  diseuasion, 
it  nay  ha  argued  that  the  steeper  early  growth  rate  reflects  the  added  availability 


of  the  proprioceptive  signal  during  the  initial  1-2  seconds  of  adaptation,  while 
the  shallower  slope  following  represents  the  integration  of  evidence  from  the 
visually  displayed  error  signal.  (2)  The  two  AU  traces,  while  not  strictly 
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Latency  after  Failure 
^ (seconds) 

Figure  6:  Cuntletiye  probability  dlatrlbv>tlon  of  detection  iatenclea  for 
alngle  and  dual ^(X  • 1.0)  task  AU  and  HA  condltlona. 
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linear,  fall  Co  show  Che  abrupC  disconCinuiCy  of  Che  MA  condlclons,  and  Chus 
seemingly  rcprcsenC  a uniform  underlying  process.  This  process,  presumably  Che 
InCegraclon  of  displayed  visual  InformaClon  accumulaCes  evidence  ac  a fasCer  race 
(sCceper  slope)  Chan  Is  evldenC  in  Che  laCer,  visual  porClon  of  Che  MA  mode.  This 
InCerpreCaClon  Is  consisCenC  wlch  Che  smaller  display  error  profiles  in  Che  MA 
as  compared  Co  Che  AU  condiCions  (Figure  5),  and  wlCh  Che  assumpClon  Chat  any 
adapcacion  in  Che  MA  condiClon  should  in  facC  reduce  Che  roagnlCude  of  Che  visual 
error  signal.  UiCh  less  InformaClon  Cherefore  available,  Che  InCegraClon  of 
Chis  InformaClon  will  procede  aC  a slower  race.  (3)  In  both  MA  and  AU  modes, 

Che  daca  of  Che  dual  cask  condiClon  lies  below,  buC  closely  parallel  Co  Che 
corresponding  single  Cask  value  (InCercepC  shlfC).  This  suggesCs  Chac  CrlCleal 
Task  performance,  while  affecClng  Che  overall  qualiCy  of  percepCual  daCa  In 
IdenClcal  manner  for  boCh  modes,  does  noc  affecC  Che  race  of  Ics  sampling  or 
acqulsiCion. 

Role  of  Workload.  The  concepC  of  vorkload--Che  cask  Imposed  demand  for  Che 
llmlced  processing  resources  of  Che  operaCor>-i8  relevanC  boCh  as  a poCenClal 
source  of  AU  deCecclon  superlorlCy,  proposed  in  Che  InCroducClon  (buC  noC  shown 
by  Che  daCa)  and  as  iC  concerns  Che  affecC  of  Che  added  requlremenC  of  CrlClcal 
Task  performance  on  deCecCion.  Concerning  Che  flrsC  issue  Ic  is  approprlaCe  Co 
ask  why  che  added  workload  of  eonCrolling  Che  primary  Cracking  Cask  apparenCly 
did  noC  hinder  Che  deCecClon  of  failures  In  Chis  Cask,  relaclve  Co  Che  AU  mode 
when  Chis  conCrolllng  funcCion  was  noC  required  and  Che  subjecc's  only  Cask  was 
Co  moniCor  Che  visual  display  for  failures.  Two  answers  may  be  provided.  (1) 

The  role  of  boCh  error  signal  and  propriocepClve  channels  demonsCraCed  in  MA 
daCecClon  suggesCs  Chac  Che  very  same  menCal  operaclons  Chac  mlghc  on  Che  one 
hand  be  argued  co  increase  che  compeClcion  for  resources  wlch  che  deCecCion 
requlremenC  are  also  Che  same  ones  ChaC  are  inCegrally  involved  in  che  MA 
deCecCion  process.  These  opereclons  Chen  ouiy  funcCion  in  cooperaCion  wich 
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the  detection  process,  rather  than  in  competition  for  the  resources  upon  which 
detection  depends.  (2)  A different  accounting  for  the  absence  of  a workload 
deterioration  effect  can  be  proposed  in  terms  of  the  nature  of  the  processing 
resources  themselves.  While  the  resources  involved  in  the  failure  detection  task 
are  primarily  related  to  perceptual  and  decision  making  mechanisms,  the  added 
resources  required  by  tracking  (as  opposed  to  monitoring)  concern  more  directly 
the  response  mechanisms.  To  the  extent  that  perceptual/decision-making  processes 
and  response  processes  draw  from  different  structural  pools  of  processing 
resources  that  are  not  mutually  available  [22,  23,  29,  30],  it  is  not  expected 
that  large  interference  would  be  evident  between  tracking  and  detection. 

This  argument,  relating  to  the  lack  of  interference  between  structurally 
different  processes  can  also  account  for  the  different  effects  upon  detection 
and  tracking  performance  of  a)  introduction  of  the  Critical  Task  (Figure  2),  and 
b)  increases  in  its  difficulty  (Figure  3).  Critical  Task  introduction  adds  both 
a new  display  element  (demand  for  perceptual  resources),  as  well  as  new  response 
demands  required  by  the  left  hand  control  manipulation  (demand  for  response 
resources).  Thus  the  expected  decrease  in  both  the  concurrent  tracking  perfor- 
mance (Figure  A),  and  in  detection  performance  (Figure  2)  can  be  predicted,  as 
their  respective  processing  resources  arc  both  depleted  by  the  added  perceptual 
and  response  demands.  However,  the  increase  in  X produces  primarily  an  added 
demand  on  the  availability  of  resources  at  the  response  stages  of  processing, 
since  the  perceptual  nature  of  the  Critical  Task  is  liutle  altered  (Critical  Task 
error  changed  little  with  X)>  but  a greater  motor  involvement  is  required. 

To  the  extent  that  the  increase  in  X thus  depletes  response  related  resources, 
but  not  perceptual  ones,  greater  deterioration  will  be  evident  in  the  response 
loading  task  (Tracking)  than  in  the  perceptual  one  (Detection). 


SUMMARY  AND  CONCLUSIONS 


The  major  results  can  be  briefly  summarized  as  follows 


1.  Detection  of  step  increases  in  system  order  when  the  operator  remains 


in  the  control  loop  (MA  mode)  is  considerably  faster  and  only  slightly 


less  accurate  than  when  he  is  removed  (AU  mode).  This  finding  of  MA 


superiority  supports  the  earlier  conclusions  of  Young  (1969) 


2.  The  extent  of  this  superiority  did  not  diminish  as  the  critical  Task 


was  added  or  as  its  difficulty  was  increased  by  raising  the  subcriCical 


value  of  X:  An  interaction  between  participatory  mode  and  workload  was 


not  obtained 


3.  The  effect  of  adding  the  Critical  Task  was  to  reduce  detection  per 


formance  in  both  modes,  but  performance  was  little  altered  with  increasing 


4.  Within  the  framework  of  the  model  presented,  converging  evidence  from 


multiple  regression,  ensemble  averaging  and  latency  distribution  data  was 


presented  suggesting  that  the  cause  of  MA  superiority  was  the  added 


proprioceptive  information,  resulting  from  control  adaptation  and  available 


for  the  first  few  seconds  following  the  failure.  This  information,  when 


coupled  with  the  displayed  visual  information  allowed  a rapid  initial 


aggregation  of  evidence  in  the  MA  mode,  yielding  short  latency  detections 


However  the  availability  of  the  proprioceptive  adaptation  information  was 


short-lived,  due  perhaps  to  the  transient  memory  for  the  proprioceptive 


standard.  Once  gone,  the  now-adapted  visual  error  signal  continued  Co 


provide  evidence  of  lesser  strength,  accumulated  at  a slower  rata  Chan 


Che  non-adapting  AU  condition.  Therefore  the  mean  and  modal  latencies  of 


AU  detections  were  longer,  but  overall  detection  was  slightly  more  accurate 

5.  The  role  of  task  workload  in  affecting  detection  performance  was 
seemingly  only  evident  as  perceptual  load  was  increased  (adding  the  require 
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menC  of  processing  the  Critical  Tisk  display)  and  not  as 
additional  demands  were  placed  upon  the  response  systems,  either 
through  tracking  the  primary  task,  or  through  the  greater  resource 
demands  of  the  increased  loading  task  difficulty. 

6.  Concerning  the  role  of  a more  stable  Interval  model  as  a cause 

of  MA  superiority,  it  can  be  argued  that  this  factor  probably  played  a 
relatively  minimal  role  in  influencing  the  presenC  results.  This  is 
because  the  repeated  measures  design  allowed  the  same  subjects  to 
participate  alternately  on  AU  and  MA  trials.  Thus  the  internal  model 
constructed  during  MA  trials,  if  superior,  was  presumably  also  avail- 
able on  AU  trials.  On  this  basis  it  may  he  hypothesised  that  even 
greater  MA  superiority  might  be  obtained  if  participatory  skmIc  were 
manipulated  as  a between  subjects  vsriable. 

7.  The  difference  between  the  findings  of  Young  and  of  Ephrath  regard- 
ing MA  vs.  AU  superiority  could  li  accounted  for  in  terms  of  the  current 
findings.  The  MA  condition  In  Young's  experiment  and  in  the  current 
one  were  in  many  respects  similar.  A step  change  in  system  order  was 
imposed  following  which  control  adaptation  was  required  entailing  a 
change  in  general  response  characteristics  'operator  describing  function) 
and  therefore  making  available  proprioceptive  inforsiation.  Conversely 
the  failure  employed  by  Ephrath,  a gradual  displacement  or  bias  of  the 
lateral  position,  is  one  for  which  no  fundasMtical  change  in  the 
operator's  transfer  function  was  required  to  adept.  Therefore  proprio- 
ceptive channels  probably  conveyed  little  if  any  infomation  relating 

to  the  occurrence  of  a failure. 

Fimilly,  some  mention  should  be  made  concerning  the  preaeoce  of  individual 
differences.  To  some  extent  these  are  Inevitable,  particularly  in  a task  con- 
figuration at  complex  as  the  current  one,  requiring  dual  taak  performance  in 
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[; 

I the  AU  mode  and  triple  task  performance  In  the  MA.  Given  the  subject's  flexi- 

bility to  allocate  resources  differentially  to  the  two  or  three  tasks,  as  wsll 
^ as  his  ability  to  adapt  various  criteria  on  the  speed-accuracy  detection  bias, 

I it  is  perhaps  somewhat  surprising  that  the  individual  subject  data  in  Figures 

2 and  3 are  as  consistent  as  they  are.  Hevertheleas,  the  importance  is  acknow- 
I ledged  of  acquiring  more  data  to  replicate  and  substantiate  the  trends  reported 

here. 
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Footnotes 


For  the  accurecy  data,  standard  error  of  proportion  scores  were  coaputed 

for  hit  and  false  alam  probabilities  by  the  fonula  S ■ ~ 

p N 

Standard-error  confidence  brackets  were  then  conputed  for  the  accuracy  aeasure 

P(A)  by  deteraining  the  ■axinun  and  ainlaun  accuracy  values  obtainable  when 

P(Hlt}  and  P(False  alam)  were  within  one  standard  error  of  their  respective 

estlaated  values,  e.g. , ■azimua  accuracy:  P(H>  -f  IS  , P(FA)  - IS  ; 

P P 

■Iniaua  accuracy:  P(H)  - IS  , P(FA)  + IS  . 

P P 


